class: center, middle, inverse, title-slide .title[ # ISA 401: Business Intelligence & Data Visualization ] .subtitle[ ## 01: Introduction to BI and Data Viz ] .author[ ###
Fadel M. Megahed, PhD
Raymond E. Gloss Professor in Business
Farmer School of Business
Miami University
@Fadel
Megahed
fmegahed
fmegahed@miamioh.edu
Automated Scheduler for Office Hours
] .date[ ### Spring 2026 ] --- # Learning Objectives for Today's Class - Describe the **analytics journey** and where BI fits within it. - Explore a **BI dashboard** and identify the questions it answers. - Define **data visualization** and describe its **main goals**. - Identify the **unit of analysis** in a dataset. --- class: inverse, center, middle # The Analytics Journey --- # The Analytics Journey: Pre-Analytics [1] - **Pre-Analytics/Data Management:** where one attempts to **extract** the needed *data* for analysis. Data can either be: .div[ .pull-left[ ## .center[.large[.large[.large[🥫]]]] * Stale, uninteresting, convenient * Highly processed and archived * Example: `iris`, `mtcars`, `titanic` ] .pull-right[ ## .center[.large[.large[.large[🍅]]]] * Fresh, interesting, challenging * Impactful * Examples: [Cincinnati Open Data Portal](https://data.cincinnati-oh.gov/), [Ohio Data Portal](https://data.ohio.gov/wps/portal/gov/data/), [US Government's Open Data](https://www.data.gov/). ] ] .footnote[ <html> <hr> </html> While the highly processed data can be useful in learning basic concepts, **real-world (often messy)** data real are much interesting to work with -- **e.g., we can make useful & meaningful decision from the data.** In this class, we will learn how to scrape, extract and clean messy data in addition to visualizing clean[ed] data. Source: Slide inspired by [Kia Ora's What I mean by "data"](https://stats220.earo.me/01-intro.html#6). ] --- # The Analytics Journey: Pre-Analytics [2] ### Non-Graded Class Activity #1
> _Take 4 minutes to discuss with your partner_ .panelset[ .panel[.panel-name[Activity] - Go to <https://data.cincinnati-oh.gov/Safety/Traffic-Crash-Reports-CPD-/rvmt-pkmq/data> - Download the data utilizing the export column and answer the following questions: * How many **observations/rows** and **columns** do we have in the dataset? * How many **crashes** are reported in the dataset? * **Key Question:** Is each row a crash, or something else? ] .panel[.panel-name[Your Solution] - .can-edit.key-activity1[Insert your solution here (Use Chrome as your browser to edit this part of the page)] ] .panel[.panel-name[Fadel's Approach (No Solution Shown)] ``` r if(require(tidyverse) == FALSE) install.packages("tidyverse") # Link obtained from site -> Export -> "Right Click on" CSV crashes = readr::read_csv("https://data.cincinnati-oh.gov/api/views/rvmt-pkmq/rows.csv?accessType=DOWNLOAD") # Number of rows and columns nrow(crashes) ncol(crashes) # Or alternatively dim(crashes) # Total number of crashes # Will be discussed in class in greater detail ``` ] ] --- # The Analytics Journey: Descriptive [1] **Descriptive Analytics:** where one attempts to **understand** the data through **descriptive statistics** and **visualizations**. ### Descriptive Statistics for 2 Categorical Variables .small[ ``` ## $dayofweek ## ## THU TUE SUN WED SAT FRI MON ## 4113 4030 2721 4233 3199 4256 3628 ## ## $weather ## ## 1 - CLEAR 2 - CLOUDY ## 18465 3627 ## 6 - SNOW 4 - RAIN ## 771 3004 ## 9 - FREEZING RAIN OR FREEZING DRIZZLE 99 - OTHER/UNKNOWN ## 10 252 ## 3 - FOG, SMOG, SMOKE 7 - SEVERE CROSSWINDS ## 29 3 ## 5 - SLEET, HAIL 8 - BLOWING SAND, SOIL, DIRT, SNOW ## 14 5 ``` ] --- # The Analytics Journey: Descriptive [2] **Descriptive Analytics:** where one attempts to **understand** the data through **descriptive statistics** and **visualizations**. ### A Simple Visualization - A Bar Chart of Crashes Per Day <img src="data:image/png;base64,#01_introduction_files/figure-html/viz-1.png" alt="" style="display: block; margin: auto;" /> --- # The Analytics Journey: Descriptive [3] **Descriptive Analytics:** where one attempts to **understand** the data through **descriptive statistics** and **visualizations**. <img src="data:image/png;base64,#01_introduction_files/figure-html/viz2a-1.png" alt="" style="display: block; margin: auto;" /> --- # The Analytics Journey: Continued .pull-left[ ### Predictive Analytics Using **statistical** and **machine learning** models to predict outcomes. **Success depends on:** - **Correct** data: Do you capture important predictors? - **Clean** data: Is it tidy, technically correct, consistent? **Recommendation:** Start simple, add complexity only if needed. ] -- .pull-right[ ### Prescriptive Analytics Using **mathematical models** to recommend business actions. **Example:** A trucking company modeling: - Events associated with crashes - Predictions based on driver, conditions **Question:** What would you optimize for? - Minimize crashes? - Minimize cost while maintaining safety? ] --- # How does our Curriculum at Miami University Prepare you for this Journey? <div class="figure" style="text-align: center"> <img src="data:image/png;base64,#../../figures/ba_flow_chart.png" alt="Fadel's take on our ISA curriculum" width="100%" /> <p class="caption">My take on the courses within the business analytics major/minor at Miami University</p> </div> --- class: inverse, center, middle # Course Design, Expectations, and Overview (Will not be covered in class; I presume you read the syllabus and the Week 0 module before our first class. ) --- # ISA 401 Course: An Overview <div class="figure" style="text-align: center"> <img src="data:image/png;base64,#../../figures/course_overview.png" alt="How the ISA 401 course is organized." width="100%" /> <p class="caption">How the ISA 401 course is organized.</p> </div> --- # ISA 401 Course Objectives Even though software will be extensively used, this is not a software class. **Instead, the focus is on understanding the underlying methods and mindset of how data should be approached.** - Be capable of extracting, transforming and loading (ETL) data using multiple platforms (e.g. <svg viewBox="0 0 581 512" style="height:1em;position:relative;display:inline-block;top:.1em;fill:#C3142D;" xmlns="http://www.w3.org/2000/svg"> <path d="M581 226.6C581 119.1 450.9 32 290.5 32S0 119.1 0 226.6C0 322.4 103.3 402 239.4 418.1V480h99.1v-61.5c24.3-2.7 47.6-7.4 69.4-13.9L448 480h112l-67.4-113.7c54.5-35.4 88.4-84.9 88.4-139.7zm-466.8 14.5c0-73.5 98.9-133 220.8-133s211.9 40.7 211.9 133c0 50.1-26.5 85-70.3 106.4-2.4-1.6-4.7-2.9-6.4-3.7-10.2-5.2-27.8-10.5-27.8-10.5s86.6-6.4 86.6-92.7-90.6-87.9-90.6-87.9h-199V361c-74.1-21.5-125.2-67.1-125.2-119.9zm225.1 38.3v-55.6c57.8 0 87.8-6.8 87.8 27.3 0 36.5-38.2 28.3-87.8 28.3zm-.9 72.5H365c10.8 0 18.9 11.7 24 19.2-16.1 1.9-33 2.8-50.6 2.9v-22.1z"></path></svg>, Power BI and/or Tableau). - Write basic scripts to preprocess and clean the data. - Explore the data using visualization approaches that are based on sound human factors. - Understand how statistical/machine learning can capitalize on the insights generated from the data visualization process. - Create interactive dashboards that can be used for business decision making, reporting and/or performance management. - Be able to apply the skills from this class in your future career. --- # Instructional Approach <div class="figure" style="text-align: center"> <img src="data:image/png;base64,#../../figures/instructional_approach.png" alt="An overview of the instructional approach for ISA 401." width="100%" /> <p class="caption">An overview of the instructional approach for ISA 401.</p> </div> --- # How will I Evaluate your Learning? <div class="figure" style="text-align: center"> <img src="data:image/png;base64,#../../figures/evaluation.png" alt="An overview of the evaluation components for ISA 401." width="100%" /> <p class="caption">An overview of the evaluation components for ISA 401.</p> </div> --- class: inverse, center, middle # Business Intelligence: From Data to Decisions --- # What is Business Intelligence? > "... to enable **interactive access (sometimes in real time)** to data, to enable manipulation of data, and to give business managers and analysts the ability to conduct appropriate analysis. By analyzing ... data, situations, and performances, decision makers get valuable insights that enable them to **make more informed and better decisions** ... BI is based on the **transformation of data to information, then to decisions, and finally to actions.**" <img src="data:image/png;base64,#../../figures/stock_market.JPG" alt="A schematic of an interactive BI tool for stock market prediction" width="55%" style="display: block; margin: auto;" /> .footnote[ <html> <hr> </html> **Quote** from Sharda, R., Delen, D., & Turban, E. (2013). Business Intelligence: A managerial perspective on analytics. Prentice Hall Press. **Image Credit:** Joint work with Bin Weng. ] --- # The BI Process <img src="data:image/png;base64,#../../figures/bi_process.jpg" alt="A schematic of the different components of the business intelligence (BI) process" width="73%" style="display: block; margin: auto;" /> .footnote[ <html> <hr> </html> **Image Credit:** Sharda, R., Delen, D., & Turban, E. (2013). Business Intelligence: A managerial perspective on analytics. Prentice Hall Press. ] --- # Job Market Explorer: A BI Dashboard in Action <iframe src="job_market_explorer_standalone.html" width="100%" height="480px" data-external="1"></iframe> .footnote[ **Data:** 1,276 federal job postings from USAJobs.gov, pulled on January 14, 2026. ] --- # Regarding the Job-Market Dashboard
.panelset[ .panel[.panel-name[Activity] ### Non-Graded Class Activity #2 .small[ You just explored a job-market dashboard. Read the dashboard carefully and then answer the questions below. Write **full, specific questions. Think like both a decision-maker and an analyst.** **Slide 1: Use it** - What is *one question you would want this dashboard to answer* to help you make a career decision? **Slide 2: Challenge it** - What is *one question you would ask about the data itself* before trusting the answer? ] ] .panel[.panel-name[Your Solution] <div style="height:400px; margin:0; padding:0; overflow:hidden;"> <iframe src="https://www.mentimeter.com/app/presentation/albckjyfoamn1927ubw9urpzik287ya4/embed" frameborder="0" allowfullscreen sandbox="allow-scripts allow-same-origin allow-presentation" style="width:100%; height:100%; border:0;"> </iframe> </div> ] ] --- # Data Source Mapping: BI Requires Merging Datasets | Question Type | Data Needed | Possible Source | Data Type | |:---|:---|:---|:---| | Salary | Wages by occupation | BLS, Glassdoor | Structured (CSV) | | Skills | Job requirements | Job postings | Unstructured (Text) | | Location | Job counts by city | Indeed, LinkedIn, USAJobs | Semi-structured (HTML/API) | | Trends | Historical postings | Archives, APIs | Time series | .footnote[ <html> <hr> </html> **Key insight:** No single source answers everything. Real analysis requires combining data from multiple places—and that's exactly what we'll learn to do. ] --- # From Data to Decisions: The BI Value Chain .pull-left[ ### The Journey | Stage | Question | Output | |:---:|:---|:---| | **Data** | What happened? | Raw facts | | **Information** | What does it mean? | Context + structure | | **Knowledge** | Why did it happen? | Patterns + insights | | **Decision** | What should we do? | Action | ] -- .pull-right[ ### Example: Job Market | Stage | Example | |:---:|:---| | **Data** | 1,276 job postings from USAJobs | | **Information** | Data & Analytics roles have median salary of $139K | | **Knowledge** | D&A roles pay $33K more than overall median | | **Decision** | Pursue Data & Analytics career paths | ] .footnote[ <html> <hr> </html> This is the journey we'll take this semester: from raw data to actionable insights. .small[ **Calculation:** Median salary for D&A roles ($138,667; n=21) minus overall median ($105,469; n=1,276). ] ] --- class: inverse, center, middle # So What is Data Visualization? --- # What is Data Visualization? Data visualization involves **presenting data in a graphical format**. It is really a process that starts by getting data, creating initial plot(s) and modifying them to answer questions of interest (and possibly making the plot aesthetically pleasing). For example, see [Cedric Scherer's visualization of the UNESCO data on global student to teacher ratios](https://www.cedricscherer.com/2019/05/17/the-evolution-of-a-ggplot-ep.-1/). <img src="data:image/png;base64,#https://d33wubrfki0l68.cloudfront.net/1e7033393a2c70dc1255c5d0f1c563e945519251/61035/img/evol-ggplot/evol-ggplot-1.gif" alt="" width="58%" style="display: block; margin: auto;" /> --- # The Goals of Data Visualization - **Record** information - **Analyze** data to support reasoning * Develop and assess hypotheses (EDA) * Reveal patterns * Discover errors in data - **Communicate** ideas to others * Infographics * Statistic charts * Interactive charts * Dashboards - **Interact with the data (which supports all the above)** --- # Record Information <iframe src="https://flo.uri.sh/visualisation/17077589/embed?auto=1" width="100%" height="480px" data-external="1"></iframe> .footnote[ <html> <hr> </html> Source: Flourish Studio (2024). [When tables are the most effective form of data visualization](https://flourish.studio/blog/how-make-interactive-table/). ] --- # Analyze Data: The 1854 Cholera Outbreak <div class="figure" style="text-align: center"> <img src="data:image/png;base64,#../../figures/cholera.jpg" alt="The physician John Snow, dealing with a Cholera outbreak plotted the cases on a map of the city (see schematic above)." width="50%" /> <p class="caption">The physician John Snow, dealing with a Cholera outbreak plotted the cases on a map of the city (see schematic above).</p> </div> .footnote[ <html> <hr> </html> Source: Leskovec, J., Rajaraman, A., & Ullman, J. D. (2020). Mining of Massive Data Sets (Third Edition). Cambridge University Press. Image is from Chapter 1, which can be accessed [here](http://infolab.stanford.edu/~ullman/mmds/ch1n.pdf). ] --- # Communicate Ideas
.panelset[ .panel[.panel-name[Activity] .pull-left[ ### Non-Graded Class Activity #3 .small[ - What **data** is represented in this visualization? Be specific about the unit of analysis. - How is the data **visually encoded**? (position, size, color, shape, etc.) - Is this encoding **effective**? Why or why not? ] ] .pull-right[ <img src="data:image/png;base64,#../../figures/wpost.jpg" alt="" width="77%" style="display: block; margin: auto;" /> ] ] .panel[.panel-name[Your Solution] - .can-edit.key-activity3[Insert your solution here (Use Chrome as your browser to edit this part of the page)] ] ] --- # Interact: GapMinder/ Hans Rosling Example <html> <center> <div style="max-width:854px"><div style="position:relative;height:0;padding-bottom:56.25%"><iframe src="https://embed.ted.com/talks/lang/en/hans_rosling_the_best_stats_you_ve_ever_seen" width="854" height="480" style="position:absolute;left:0;top:0;width:100%;height:100%" frameborder="0" scrolling="no" allowfullscreen></iframe></div></div> </center> </html> --- class: inverse, center, middle # Recap --- # Summary of Main Points By now, you should be able to do the following: - Describe the **analytics journey** (pre-analytics, descriptive, predictive, prescriptive) and where **BI** fits within it. - Explore a **BI dashboard** and identify the questions it can (and cannot) answer. - Define **data visualization** and describe its main goals (record, analyze, communicate, interact). - Identify the **unit of analysis** in a dataset (rows vs. observations). - Recognize how data is **visually encoded** (position, size, color, shape). --- # 📝 Review and Clarification 📝 .pull-left[ 1. **Class Notes**: Take some time to revisit your class notes for key insights/concepts. 2. **Zoom Recording**: The recording of today's class will be made available on Canvas approximately 3-4 hours after the session ends. 3. **Review AI Recaps:** The video and podcast are both generated from our slide deck and are meant to reinforce the same core ideas. 4. **Questions**: Do not let confusion pile up. ] .pull-right[ **📺 Slide Deck Video (click to play)** <iframe width="100%" height="230" src="https://www.youtube.com/embed/20cFIOJBc3g?autoplay=0" title="NotebookLM Slide Deck Review" frameborder="0" allow="accelerometer; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen> </iframe> <br> **🎧 Slide Deck Podcast (click to play)** <iframe src="https://drive.google.com/file/d/1lt6hqrMuW8JOy9ilvkT5NVlPABowj7SE/preview" width="100%" height="70" allow="clipboard-write"> </iframe> ] --- # 📖 Required Readings: R Prep 📖 .pull-left[ .font90[ #### 📈 R Prep - [Workflow: Basics](https://r4ds.had.co.nz/workflow-basics.html) - [Names and Values](https://adv-r.hadley.nz/names-values.html) - [Vectors](https://adv-r.hadley.nz/vectors-chap.html) - [Subsetting](https://adv-r.hadley.nz/subsetting.html) ] ] .pull-right[ .font90[ **📺 Optional Video (R Prep)** <iframe width="100%" height="260" src="https://www.youtube.com/embed/uFe-6KaceJk?autoplay=0" title="R Prep Review Video" frameborder="0" allow="accelerometer; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen> </iframe> ] ] --- # 📖 Required Readings: LLM Prep 📖 .pull-left[ .font90[ #### 🤖 LLM Prep - [A Very Gentle Introduction to Large Language Models without the Hype](https://mark-riedl.medium.com/a-very-gentle-introduction-to-large-language-models-without-the-hype-5f67941fa59e) ] ] .pull-right[ .font90[ **📺 Optional Video (LLM Prep)** <iframe width="100%" height="260" src="https://www.youtube.com/embed/F5Gs6-C0DiA?autoplay=0" title="LLM Prep Review Video" frameborder="0" allow="accelerometer; clipboard-write; encrypted-media; gyroscope; picture-in-picture" allowfullscreen> </iframe> ] ] --- # 🎯 Assignment 🎯 - Complete [Assignment 01](https://miamioh.instructure.com/courses/250351/quizzes/774525) on Canvas to reinforce your understanding and application of the topics covered today as well as the assigned readings. <div style="background: #e0f2fe; border-left: 4px solid #0ea5e9; padding: 15px 20px; margin-bottom: 25px; border-radius: 0 8px 8px 0;"><strong style="color: #0369a1;">Before You Begin:</strong> <p style="margin: 10px 0 0 0; font-size: 14px;">This assignment has three parts. Make sure you have reviewed:</p> <ul style="margin: 10px 0 0 0; padding-left: 20px; font-size: 14px;"> <li>Class 01 slide deck and/or Zoom recording</li> <li>NotebookLM summaries: <ul style="margin: 10px 0 0 0; padding-left: 20px; font-size: 14px;"> <li><a style="color: #c41230;" href="https://youtu.be/20cFIOJBc3g" target="_blank" rel="noopener"><strong>01_BI__Data_to_Decisions.mp4</strong></a></li> <li><a style="color: #c41230;" href="https://drive.google.com/file/d/1lt6hqrMuW8JOy9ilvkT5NVlPABowj7SE/view?usp=sharing" target="_blank" rel="noopener"><strong>01_Turning_Messy_Data_Into_Business_Strategy.m4a</strong></a> (podcast, you may want to download first please)</li> </ul> </li> <li>Required readings for Class 02 (listed at the end of Class 01 slides)</li> <li>NotebookLM summary: <a style="color: #c41230;" href="https://youtu.be/F5Gs6-C0DiA" target="_blank" rel="noopener"><strong>02_A_Gentle_Intro_to_LLMs.mp4</strong></a></li> </ul> </div>